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Genome-wide association studies [GWAS) identified the MEIS1 locus for Restless Legs Syndrome (RLS), but causal single 
nucleotide polymorphisms (SNPs) and their functional relevance remain unknown. This locus contains a large number of 
highly conserved noncoding regions (HCNRs) potentially functioning as c/5-reguIatory modules. We analyzed these 
HCNRs for allele-dependent enhancer activity in zebrafish and mice and found that the risk allele of the lead SNP 
rs!2469063 reduces enhancer activity in the Meisl expression domain of the murine embryonic ganglionic eminences (GE). 
CREB1 binds this enhancer and rs!2469063 affects its binding in vitro. In addition, MEIS1 target genes suggest a role in the 
specification of neuronal progenitors in the GE, and heterozygous Me/$/-deficient mice exhibit hyperactivity, resembling 
the RLS phenotype. Thus, in vivo and in vitro analysis of a common SNP with small effect size showed allele-dependent 
function in the prospective basal ganglia representing the first neurodevelopmental region implicated in RLS. 



[Supplemental material is available for this article.] 

With an age-dependent prevalence of up to 10% in western soci- 
eties, RLS is a sensorimotor neurological disorder characterized by 
an urge to move the legs and uncomfortable sensations in the 
lower limbs. These symptoms occur exclusively during rest, in 
the evening, or at night, and improve only with movement 
(Szentkiralyi et al. 2011). Therefore, patients are forced to stand up 
and walk, sometimes calling themselves "nightwalkers." Conse- 
quences are severe sleep disturbances, depression, anxiety, and 
increased cardiovascular risk (Winkelman et al. 2008). Severely 
affected patients require medication, with dopaminergic compounds 
being the first line treatment, but side effects such as augmentation 
limit their long-term use (Hornyak et al. 2012). So far, very little is 
known about the underlying neurobiology of this disorder. 

RLS has a heritability of —50% and is a complex genetic dis- 
order. GWAS identified common genetic variants within six risk 
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loci (Winkelmann et al. 2007, 2011; Schormair et al. 2008). These 
convey only small effects on the phenotype; and generally the 
molecular mechanism and function behind low-effect-size com- 
mon genetic variants identified in complex genetic diseases pro- 
vide a formidable challenge. The strongest RLS association signal 
identified delineates a 32-kb linkage disequilibrium (LD) block in 
intron 8 oiMEISl (2pl4). A common haplotype completely tagged 
by a pair of highly correlated intronic SNPs, rs 12469063 and 
rs2300478 (LD r 2 = 0.97), and rs6710341 in this block was shown 
to confer a significantly increased risk for RLS (odds ratio [OR] of 
2.8) (Winkelmann et al. 2007). MEIS1 belongs to the TALE family 
of homeobox transcription factors and is involved in the devel- 
opment and homeostasis of numerous organs and diseases such as 
leukemia or neuroblastoma (Azcoitia et al. 2005). In the nervous 
system, multiple expression domains suggest essential roles for 
MEIS1 (Toresson et al. 2000); however, its spatial, temporal, and 
functional involvement in RLS pathogenesis has remained un- 
known. The MEIS1 locus comprises a cluster of HCNRs, and it has 
been shown that these indicate potential ds-regulatory elements 

© 2014 Spieler et al. This article, published in Genome Research, is available 
under a Creative Commons License (Attribution-NonCommercial 3.0 Unported), 
as described at http://creativecommons.Org/licenses/by-nc/3.0/. 



592 Genome Research 

www.genome.org 



24:592-603 Published by Cold Spring Harbor Laboratory Press; ISSN 1088-9051/14; www.genome.org 



RLS-associated SNP alters telencephalic enhancer 



for genes implicated in transcriptional 
regulation and development (Woolfe 
et al. 2005). Since disease-associated var- 
iants identified by means of GWAS are 
more likely located in regulatory DNA 
(Maurano et al. 2012), we aim to study 
the causal role of MEIS1 in RLS by ana- 
lyzing the allele-dependent ds-regula- 
tory function of the common intronic 
RLS-associated DNA variants of MEIS1 
and differential binding of upstream fac- 
tors as well as RLS-related behavioral 
consequences of altered Meisl function. 

Results 

Identification of common candidate 
causal variants in the MEIS1 GWAS 
locus 

To capture common variation found in 
the MEIS1 GWAS locus, we screened the 
entire intronic 32-kb LD block associated 
with RLS in 188 RLS patients for variants 
using high-resolution melting curve 
analysis (Herrmann et al. 2006). Our dis- 
covery sample consisted of carriers of the 
known risk haplotype in order to enrich 
candidate causal variants residing on this 
haplotype. We focused our analysis on 
common variants with a minor allele 
frequency (MAF) > 0.05 due to our small 
sample size limiting the power for rare 
and low frequency variants. Of 136 vari- 
ants identified, 54 were common SNPs in 
our data set (MAF > 0.05) (Supplemental 
Table SI). To account for sampling bias in 
our discovery sample, we also included 
five variants reported as common in the 
general population (dbSNP Build 130), 
but with lower frequency in our data set, 
yielding a total of 59 variants. We geno- 
typed these in independent 735 cases 
and 735 age-, sex-, and ethnicity-matched 
population-based controls (KORA cohort) 
(Wichmann et al. 2005). Based on strong 
LD, a total of 16 variants were assayed 
using tagging SNPs. We then imputed 
missing genotypes in additional samples, 
leading to a final set of high-quality ge- 
notypes of 38 variants for statistical analy- 
sis in 1302 cases and 1259 controls. Logistic 
regression followed by permutation-based 
adjustment for multiple testing revealed 
27 associated SNPs (P corr < 0.05) (Supple- 
mental Table S2). The two strongest sig- 
nals were the known highly correlated 
rsl2469063 (P n0 m= 7.7 X 1(T 18 ; OR = 1.7) 
and rs2300478 (P nom = 4.2 X 1(T 18 ; OR = 
1.7) (Fig. IB). Conditional analysis with 
either rsl2469063 or rs2300478 as cova- 
riate did not reveal additional indepen- 
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Figure 1. Association signals, sequence conservation, and epigenetic signatures in embryonic and 
adult stages of the MEIS1 /Meisl locus (A) RLS-associated locus on 2p1 4 with the lead SNP rsl 2469063 
(violet diamond) and its genetic environment. (B) The x-axis represents the genomic position, re- 
ferring to the hgl 8 genome annotation. The right-hand /-axis represents the recombination rate, and 
the negative loglO of the nominal P-values of all SNPsgenotyped are given on the left-hand /-axis. The 
color-coded LD between SNPs is based on the lead SNP rsl 2469063 (violet diamond) as a reference. 
rs2300478, which lies in a nonconserved region, is depicted with a red circle. Recombination rate and 
r 2 values were estimated with the CEU population from HapMap II (release 22). The LD block below is 
based on HapMap data, measured in r 2 and visualized with Haploview. Highlighting refers to the 
magnitude of pairwise LD, with a white-to-black shading indicating lower to higher LD values. The 
lower part of Figure 1 B uses mouse genome annotation mm9 and lists from top to bottom: (1) con- 
servation between the mouse genome and the genome of the rat, human, frog, zebrafish, and Fugu; 
(2) corresponding localization of the human RLS-SNPs rsl 2469063 (violet diamond) and rs2300478 
(red circle); (3) seven analyzed HCNRs in turquoise boxes; (4) the conservation in mammals (dark 
blue); (5) DNA binding of EP300 (ChlP-seq) in El 1 .5 forebrain (blue) and midbrain (gray); and (6) 
DNase I hypersensitivity in whole brain of El 4.5 embryos and adult mice (red and orange peaks, 
respectively). 
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dent association signals within the 32-kb block after permutation- 
based adjustment but confirmed the known risk haplotype con- 
figuration rsl2469063/rs6710341. This haplotype showed an in- 
creased effect size with an OR of 2.33 compared to all other hap- 
lotypes with a frequency > 1% in our data set combined (95% 
confidence interval: 1.85-2.94 with an overall frequency [f] of 
0.138 defined by rs6710341 - allele A [f= 0.857] and rsl2469063 - 
allele G [f= 0.282]). Taken together, the common DNA variants 
rsl2469063 and rs2300478 remained the lead SNP pair within the 
32-kb block with the increased haplotype risk being compatible 
with additional small effect contributors not detectable by our 
approach. 

rs!2469063 impairs enhancer activity in zebrafish 

Given that ds-regulatory function is often found in HCNRs of 
developmental genes (Woolfe et al. 2005), we focused on variants 
found within these elements. Thus, we systematically examined all 
HCNRs in the associated LD block (Figs. 1, 2) in a zebrafish trans- 
genic EGFP-reporter assay (i.e., 24/48 h post-fertilization [hpf]). 
Altogether, eight HCNRs showed high conservation in mouse, 

A HCNR -EGFP expression . 
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HCNR ^™ ™ ^^^"^^ ■ 
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B rs1 2469063 - HCNR 6W-EGFP 




Figure 2. In vivo allele-specific enhancer function of HCNR 617 in 
zebrafish. (A) Results of the zebrafish enhancer screen with the expression 
domains for the protective and risk allele in orange and purple, re- 
spectively. (B) Representative embryos of the F1 generation with the 
protective ([A] adenine) and the risk allele ([G] = guanine) construct of 
HCNR 617 showed an allele-specific difference in reporter EGFP expres- 
sion in the neural tube. The protective allele of HCNR 61 7 drove EGFP 
expression in the retina, fore-, mid-, and hindbrain and spinal cord. The 
risk allele significantly reduced the expression almost to background levels 
(bar represents 250 |im). (C) Boxplot (left) The risk allele reduced the 
fluorescence intensity to 70% compared to the protective allele (100%) 
(Wilcoxon rank sum test, P=0.08; EGFP-brain signal measured, H P rotective = 
5, firisk = 5). The bar chart (right) represents the percentage of all EGFP- 
positive founders in the respective area (/-axis) with respect to the pro- 
tective (orange) and risk (purple) allele of HCNR 61 7. The risk allele leads 
either to a significant reduction or a complete loss of the EGFP signal. 



chicken, and frog, five of which were conserved also in the Fugu 
genome (HCNR 602, 606, 617, 628, and 629). Since one HCNR did 
not contain any common variants, we subjected the remaining 
seven HCNRs to further analysis (HCNR 602, 606, 612, 617, 622, 
629, and 631) (Fig. 2). None of the examined HCNRs contained 
any rare variants. While SNP rs 12469063 resided in one of the 
HCNRs (617), rs2300478 was located in a nonconserved stretch 
and therefore was not further examined (Fig. IB). 

Of all HCNRs assayed, solely HCNR 617 exhibited both 
a reproducible neural expression pattern and a genotype-specific 
expression comparing the protective and risk allele reporter con- 
structs (Fig. 2; Supplemental Fig. SI). With the protective allele, an 
enhancer signal was detected in the retina, the fore-, mid-, and 
hindbrain, and the spinal cord. In contrast, zebrafish embryos 
carrying the risk allele showed highly reduced enhancer activity 
(Fig. 2B,C). Together, these findings indicate that rsl2469063 lies 
within a region of high interspecies conservation with neural en- 
hancer activity and has an allele-specific functional impact. 

rs!2469063 alters enhancer activity in the murine 
embryonic ganglionic eminences 

Next, HCNR 617 was analyzed in a transgenic mouse beta- 
galactosidase (lacZ) enhancer assay (Pennacchio et al. 2006). 
We found a reproducible lacZ gene expression pattern for both 
alleles in the forebrain of all embryos (E12.5) within the ganglionic 
eminences (Fig. 3A,D; Supplemental Fig. S2), giving rise to the 
basal ganglia of the adult brain, in addition to parts of the amyg- 
dala and several interneuron populations (Medina and Abellan 
2012). Notably, lacZ reporter activity matched the endogenous 
Meisl telencephalic expression domain (Fig. 3D). In addition, tran- 
scripts of all further RLS-associated loci, Btbd9, Ptprd, Map2k5, and 
Tox3 (Winkelmann et al. 2011) were expressed in this telencephalic 
area (Fig. 3E). Given the allele-specific impaired EGFP expression in 
zebrafish, we looked for allele-specific differences of lacZ activity, re- 
lying on the assumption that both constructs have the same bias for 
copy number as suggested by the small variance of both groups. 
Transgenic embryos of Theiler stages 19 and 20 (Theiler 1989) were 
included in the evaluation of beta-galactosidase signal intensity and 
spatial expression measured according to Cavalieri (n = 4 for both 
the protective and risk allele) (Fig. 3A,B; Supplemental Figs. S2, S3). 
For risk allele carriers, analyses showed a significant reduction of 
65% in signal intensity (P = 0.029) and of 76% in the stereological 
analysis (P = 0.029) (Fig. 3A,B). 

Direct evidence for an allele-specific function of rs 12469063 
was further provided by electrophoretic mobility shift assays 
(EMSA). Incubation with nuclear protein extracts from forebrain of 
El 2.5 mice showed differential allele-specific gel shifts (Fig. 3C). 
The gel shifts were specific, as competition with the unlabeled al- 
lele in excess attenuated the differential bands. Combined with the 
results of the transgenic animal models, where the risk allele re- 
duces enhancer function, this implies an enhanced binding of an 
activator protein complex to the protective allele or a repressor 
protein complex to the risk allele. The lead SNP rs 12469063 was 
the only DNA sequence variation between the two oligonucleo- 
tides examined with EMSA, and thus held responsible for the dif- 
ferential DNA-protein complex formation (Fig. 3C). 

Since binding of the enhancer-associated epigenomic marker 
EP300 allows the prediction of tissue-specific enhancer activity 
(Visel et al. 2009), we assessed the 32-kb LD block for EP300 
binding in ChlP-seq data of murine brain (El 1.5). The only EP300 
peak corresponding to HCNR 617 was detected in forebrain tissue 
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Figure 3. In vivo and in vitro identification of allele-specific HCNR 61 7 enhancer function in the ganglionic eminences of the mouse. (A) Representative 
transgenic mouse embryos after pronucleus injection of HCNR 61 7 protective (left) and risk allele constructs (right) at stage El 2.5. Blue color indicates 
regions expressing the reporter gene beta-galactosidase (lacZ). Arrowhead indicates the reproducible telencephalic signal (bar represents 2.5 mm). 
(B) Analysis of the signal intensity in Theiler-staged embryos (stages 1 9 and 20; n protective = 4, n risk = 4) showed a reduction down to 35% (Wilcoxon rank sum 
test, P = 0.029). Stereological volume estimation according to Cavalieri revealed a significant volume reduction down to 24% for the risk allele (Wilcoxon 
rank sum test, P = 0.029). (C) Electrophoretic mobility shift assay using oligonucleotides encompassing rsl 2469063 showed allele-specific differences of 
DNA-protein complex formation using El 2.5 forebrain nuclear extract (lanes 3 and 4, arrow and arrowhead). Specificity is proven by competition with 
unlabeled risk and protective oligonucleotide in excess (lanes 6-18). CTRL, control (no protein). (D) Frontal sections through the forebrain reveal lacZ 
reporter activity in the mantle zone (mz) of the ganglionic eminences, in the same region as Meisl transcript and MEIS1 protein (arrowheads; bar 
represents 500 |im); no transcripts are visible in the ventricular (vz) and subventricular (svz) zones. (£) Transcripts of all additional GWAS-RLS risk loci map 
to the embryonic ganglionic eminences. mRNA of Ptprd, Btbd9, and Map2k5 was detected in the mantle zone (mz) of the LGE and MGE. Tox3 was 
expressed in the adjacent ventricular (vz) and subventricular (svz) zone (arrowheads; bar represents 500 urn). 
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(Fig. IB). Next, we analyzed whether HCNR 617 is also an active 
enhancer during adulthood by using DNase I hypersensitivity 
maps, indicating active regulatory regions (Sabo et al. 2006). This 
analysis further supported the active status of HCNR 617 during 
development (E14.5), whereas in adult brain tissue (8 wk), no ac- 
tivity of HCNR 617 could be identified (Fig. IB). Together, these 
observations underline the spatiotemporal enhancer specificity of 
HCNR 617 and strongly suggest MEIS1 regulation in RLS during 
development rather than in adult tissue homeostasis. 

rs!2469063 affects a cAMP response element-binding 
protein /CREB1 transcription factor binding site 

To identify upstream transcription factors binding the sequence 
motif altered by rs 12469063, we carried out affinity chromatog- 
raphy using allele-specific DNA oligonucleotides and nuclear 
protein extracted from E12.5 mouse embryonic telencephalon. 
Nuclear proteins bound to the affinity matrix were identified by 
mass spectrometry, and protein levels were quantified allowing 
calculation of a ratio reflecting binding preference (Fig. 4A). A total 
of 51 identified proteins demonstrated allele-specific differential 
binding (mean fold change > 1.5) (Supplemental Table S3), with 
33 showing consistent results across three independent experi- 
ments. Among these, CREB1 was the protein with the highest fold 
change, also known to bind DNA in a sequence-specific manner 
and of functional relevance in the CNS. We observed a strong 
similarity of the genomic sequence spanning rs 12469063 and the 
consensus CREB binding site (Fig. 4B) and found Crebl /CREB1 to 
colocalize with Meis 1 /MEIS1 in the mantle zone of the ganglionic 
eminences at both transcript and protein levels (Fig. 4C,D). 
Moreover, rs 12469063 influenced the predicted CREB1 transcrip- 
tion factor interaction based on in silico analysis (Genomatix, 
SNPInspector). Furthermore, supershift experiments with a CREB- 
specific antibody attenuated the gel shift (Fig. 4E). As the super- 
shifted band showed a higher intensity signal for the risk allele, 
and mass spectrometry results showed that CREB1 binds 3.2-fold 
higher to the risk allele (Supplemental Table S3), we suggest that 
CREB1 binds more strongly to the risk allele. Finally, disruption of 
the entire predicted CREB1 binding site resulted in the loss of the 
specific gel shift (Fig. 4E). Combined, these data sets identified 
CREB1 as an upstream factor differentially binding rs 12469063. 

MEIS1 downstream target genes in the E12.5 
ganglionic eminences 

To model the reduced expression of MEIS1 in RLS, we chose mice 
heterozygous for the Meisl tmlMtor allele. These mice express only 
one functional allele of Meisl and should thus recapitulate re- 
duced MEIS1 expression in RLS patients. Whereas homozygous 
Meisl tmlMtor mice are embryonic lethal by E14.5, heterozygous 

Mdsl tmlMtor mice viable fertne ( AzCQitia et al 2005). TO 

identify genes that are directly or indirectly regulated due to re- 
duced Meisl expression in the LGE/MGE, we performed whole 
genome transcriptomics of forebrains from wild-type and hetero- 
zygous Meisl tmlMtor embryos (E12.5) (Fig. 5). Based on significance 
analysis of microarrays (SAM), we found 126 significantly regulated 
genes in male and 161 in female embryos (mean fold change > 1.5; 
false discovery rate [FDR] < 10%). Hierarchical cluster analysis of 
genes differentially regulated (unpaired two class analysis [SAM] 
of 285 genes; FDR < 7.9%; fold change > 1.5) revealed that the ex- 
pression profiles clearly cluster according to the respective sex 
(Fig. 5; Supplemental Table S4). Using pathway analysis software 



(Ingenuity), we identified significantly overrepresented func- 
tional annotations in male and female embryos. Despite the dif- 
ferences, we found that in both sexes the functional categories 
"neurological disease" and "nervous system development" were 
among the most significant overrepresented biological processes 
(Table 1). In addition, the occurrence of the "cardiovascular disease" 
category was in agreement with the reported function of Meisl 
during vasculogenesis (Azcoitia et al. 2005). Of note, no transcripts 
of the currently known RLS-associated risk loci, Btbd9, Ptprd, 
Map2kS, and Tox3, were regulated, although transcripts belonging 
to homologous genes, such as Ptprv, Ptprc, and Map3kl2, were dif- 
ferentially expressed. Interestingly, the homeobox gene Gbx2, 
which is essential for the proper development of cholinergic in- 
terneurons in the striatum (Chen et al. 2010), was found to be 
down-regulated (Table 1; Supplemental Table S4). Altogether, these 
data correlate with known MEIS1 function regulating transcrip- 
tion of developmental genes rather than housekeeping genes 
(Penkov et al. 2013). 

Motor restlessness /hyperactivity in heterozygous 

Meisl tm1Mtor mice 

Given the reduced reporter gene activity in the endogenous Meisl 
telencephalic expression domain, we also aimed to model the 
functional effect of reduced Meisl expression on behavior and 
analyzed adult heterozygous Meisl tmlMtor mice (Azcoitia et al. 
2005). In the open field test measured over 20 min, we found 
a significantly increased total distance traveled and average speed 
of forward locomotion in heterozygous compared to wild-type 
mice (P = 0.003 and P = 0.004) (Fig. 6A,B). No changes were ob- 
served with respect to anxiety-related behavior (Supplemental Fig. 
S4). Moreover, we found a consistent trend of elevated average 
distance traveled measured over a period of 21 h (up to 16.3%, P = 
0.06) (Fig. 6D) with corresponding enhanced energy metabolism 
in the mutants measured by indirect calorimetry (Fig. 6C; Supple- 
mental Table S5). The oxygen consumption adjusted to the body mass 
differences was significantly increased in the mutants (adjusted mean 
VO z : males plus 8.5%, females plus 5.5%, P ge notype = 0.007; adjusted 
minimum V0 2 : males plus 7.8%, females plus 4.3%, Pgenotype = 0.04; 
adjusted maximum V0 2 : males plus 6.5%, females plus 10.7%, 
^genotype = 0.003) (Fig. 6C). Taken together, we consider these findings 
as an in vivo demonstration that reduced expression of Meisl is leading 
to an alteration of the behavior toward hyperactive locomotion. 

Discussion 

In the present study, we demonstrated the allele-specific impairment 
of an intronic ds-regulatory element by the common, RLS-associated 
SNP rs 12469063 in two independent in vivo experimental systems. 
In mice, the risk allele led to a significant reduction of the enhancer 
activity in the LGE and MGE, suggesting this brain area as RLS-rele- 
vant. We concentrated on genetic variation in highly conserved 
noncoding regions assuming subsequent alteration in gene expres- 
sion contributing to complex disease (Maurano et al. 2012). Non- 
coding sequence conservation has been shown to be capable of 
identifying human ds-regulatory elements (Prabhakar et al. 2006). 
Therefore, we might have missed functional effects of SNPs mediated 
through other mechanisms such as microRNA regulation, coding 
SNPs as well as noncoding regulatory elements lacking conservation. 

Meisl expression corresponding to the region of lacZ reporter 
activity strongly points to Meisl being regulated by the identified 
enhancer. Moreover, a bioinformatic approach for delineating 
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Figure 4. rsl 2469063 alters a CREB1 transcription factor binding site. (A) Principle work flow for the identification of upstream binding factors by 
affinity chromatography and mass spectrometry. (B) CREB consensus sequence compared to the human sequence spanning rsl 2469063. (C,D) Creb/CREB 
and Me/s7/MEIS1 were both detected in the mantle zone of the ganglionic eminences and show colocalization (bar in C represents 500 urn; bar in D 
represents 250 urn). (D) Magnification of rectangle in C. (£) Supershift EMSA assay with a specific antibody against CREB showed significant reduction of 
DNA-protein complex formation compared to unspecific IgG antibody (control) for overexpressed CREB1 in 293T nuclear cell lysate (lanes 7-6) and El 2.5 
forebrain nuclear extract (lanes 7-12, arrowhead). Complete abolishment of specific binding with an oligonucleotide, in which the entire CREB motif is 
deleted (lane 1 3). CREB consensus oligonucleotide shows specific binding, being absent with the mutated CREB consensus oligonucleotide (lanes 14,15). 
Arrow indicates additional allele-specific band. 



enhancer-gene interactions ("PreSTIGE") (Corradin et al. 2014) 
and Poll II ChlA-PETand Hi-C experiments in two human cell lines 
(MCF7 and IMR-90) (Dixon et al. 2012; Li et al. 2012) demon- 



strated an exclusive interaction of the RLS-associated intronic re- 
gion with the regulatory landscape of MEIS1 (Supplemental Table 
S6; Supplemental Fig. S5; Supplemental Methods). 
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Figure 5. MEIS1 transcriptional target genes in the El 2.5 ganglionic 
eminences. Summarized heat map from unpaired two class analysis 
(SAM) of genes regulated between heterozygous Meisl tmlMtor and wild- 
type forebrain tissues at El 2.5. Expression profiles of male and female 
embryos clearly separate. Similar gene profiles are grouped together (rows) 
and color code gives the mean fold changes of the respective genes for each 
mutant embryo. Yellow represents up-regulation and blue represents 
down-regulation in comparison to the respective wild-type control group. 
The number of genes that are included in each group (A-G) with similar 
patterns is given. Individual genes are listed in Supplemental Table S4. 



The LGE/MGE give rise to the prospective basal ganglia, i.e., 
pallidum and striatum with several projection neuron and in- 
terneuron populations, selected parts of the amygdala, cortical, 
and olfactory interneurons (Medina and Abellan 2012). Concerning 
RLS, numerous neuroanatomical regions have been suggested to be 
involved (Dauvilliers and Winkelmann 2013). The functional rele- 
vance of the RLS-SNP rsl2469063 in the LGE/MGE suggests that RLS 
can be regarded as a "basal ganglia disorder," although further 
neural structures and pathways might be involved. 

The activity of the enhancer during development, indicated 
by the loss of DNase I hypersensitivity signal in HCNR 61 7 in adult 
brain, argues for a role restricted to embryonic development. Our 
data emphasizes a new focus for RLS research and suggests that the 
pathophysiology of RLS includes aspects of a neurodevelopmental 
disorder. As aging is the strongest risk factor for RLS, it is possible 
that early neurodevelopmental alterations in the LGE/MGE might 
not be compensated later in life (Somel et al. 2010). The reduced 
Meisl expression might lead to altered neuronal pathway forma- 
tion, including either striatal projections or interneurons, as sug- 
gested by down-regulation of Gbx2 in heterozygous Meisl tmlMtor 
mice. Gbx2 is required for the development of cholinergic in- 
terneurons in the striatum (Chen et al. 2010). Transcriptome 
profiling identified further MEIS1 downstream targets involved 
in specification of progenitor cells in the ganglionic eminences 
(Tucker et al. 2008), as well as protein tyrosine phosphatases (Ptprv, 
Ptprc), which might have redundant functions next to the RLS- 
associated PTPRD on 9p23-24 (Schormair et al. 2008). 

By affinity chromatography and mass spectrometry, we 
identified the transcription factor CREB1 as a potential upstream 
factor, binding stronger to the risk allele of rs 12469063 in in vitro 
assays. Since sensitivity of mass spectrometry-based detection of 
differentially binding proteins is limited due to low protein 
abundances, it is possible that our analysis did not cover all dif- 



ferential binding proteins. Along this line, an additional specific 
gel shift in the EMSA (Fig. 5E) provides evidence for a further 
protein complex with increased binding to the protective allele. 
CREB1 either activates or inactivates the transcription of down- 
stream target genes (Bartsch et al. 1998). Here, we observed an 
enrichment of CREB1 binding and a reduction of the enhancer's 
activity for the risk allele, implying a function of CREB1 as a tran- 
scriptional repressor affecting rs 12469063. 

Based on the zebrafish reporter screen, EP300 ChlP-seq, DN- 
ase I hypersensitivity data, and the association study, we focused 
on HCNR 617. Both animal models showed allele-dependent al- 
tered enhancer function, but the domain in the teleost central 
nervous system covered additional caudal structures such as the 
spinal cord. It has been shown that orthologous sequences with 
a high evolutionary distance do not necessarily retain the same 
activity in their respective species (Schmidt et al. 2010). Besides 
HCNR 617, additional ds-regulatory elements might be involved 
in the regulation of MEIS1 in relation to RLS. Thus, our HCNR 
zebrafish screen identified two additional transcriptionally active 
enhancers (HCNR 629 and 631) showing neural allele-dependent, 
but spatially not reproducible reporter activity, previously referred 
to as booster activity (Fig. 2; Supplemental Fig. SI; Royo et al. 
2012), as well as one HCNR with general, yet allele-independent 
booster activity (HCNR 602). Moreover, the DNase I hypersensi- 
tivity data provided evidence for potential ris-regulatory activity of 
further HCNRs (HCNR 602, 606, and 622), two of which comprise 
RLS-associated variants (HCNR 602 and 622). It has been shown 
that genetic variants in strong LD map to clusters of enhancers, 
which cooperatively affect gene expression and thus confer 
concerted susceptibility to disease (Corradin et al. 2014). Given 
our results, such modular organization with multiple enhancers 
next to the identified enhancer HCNR 617 might play a role in 
the pathophysiology of RLS. This concept is in line with the 
identified higher-risk haplotype for RLS, defined by rs 12469063 
and rs6710341. For all associated common SNPs in all seven 
HCNRs, this haplotype carries the respective risk alleles, suggesting 
a possible functional role of these variants next to rsl 2469063 
within the MEIS1 locus. 

Setting the focus of our study on common variants within 
HCNRs increased its power but also entailed some limitations. We 
focused strictly on the 32-kb block of strong LD identified in the 
initial RLS GWAS (Winkelmann et al. 2007), which is embedded 
in a larger 64-kb block of weaker LD, and consequently have not 
considered variants in long-distance LD with the lead SNPs. In 
addition, rare and low-frequency variants as well as other types of 
genetic variation have not been analyzed. However, our functional 
zebrafish screen clearly identified HCNR 617 containing the RLS- 
associated SNP rsl 2469063 as the only allele-dependent enhancer, 
highlighting its functional importance in RLS. 

Behavioral consequences of reduced Meisl expression 
showed hyperactivity and increased energy expenditure in mutant 
mice. Assessing however, a murine phenotype for alterations of 
a behavioral pattern is challenging. Here, the observed behavior of 
mutant mice correlates well with the human phenotype hall- 
marked by an enhanced activity and motor restlessness. We cannot 
exclude that additional factors modify the observed phenotype 
as MEIS1 is a pleiotropic protein and also expressed in other 
movement-related CNS regions. Yet, our findings support a loss-of- 
function model as a disease mechanism. This is in line with loss-of- 
function models in mouse and Drosophila for the RLS-associated 
locus BTBD9, also demonstrating hyperactive behavior (DeAndrade 
et al. 2012; Freeman et al. 2012). 
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Table 1. MEIS1 target genes in the ganglionic eminences of male and female E12.5 heterozygous Meis 1 tm 1 Mtor mice 
GO biological process 



Category Subcategory P-value Genes 



Male 

Neurological disease Progressive motor neuropathy 4.72 

Schizophrenia 6.68 

Amyotrophic lateral sclerosis 9.93 

Neurological signs 1.19 

Multiple sclerosis 1.74 

Neurodegeneration 1 .84 

Seizure disorder 2.52 

Dyskinesia 2.64 

Rett Syndrome 2.88 

Movement disorders 2.92 

Neuromuscular disease 3.10 

Nervous system Synaptic transmission 1 .50 

development Long-term potentiation 3.46 

Morphology of cerebellum 4.38 

Morphology of central 4.62 
nervous system 

Cardiovascular disease Vascular disease 3.39 

Stroke 6.12 

Thrombosis 8.06 

Occlusion of blood vessel 2.59 

Infarction 2.64 

Female 

Neurological disease Encephalitis 3.73 

Nervous system Abnormal morphology 1.27 

development of vertebrae 

Behavior Maternal nurturing 2.68 

Memory 1.35 

Cardiovascular Mass of heart 1 .1 7 

system and disease Myocardial infarction 3.70 

Vasculogenesis 3.94 



x 10 03 Mapt, Nrxnl, Ptprc, Rbfoxl, Slclal, Slc24a3, Sv2a, Tceal 

x 10 03 Aldh3b1, Chrnb3, Nrxnl, Rbfoxl Mt2, Slclal S\c25a27, 

Stmn2, Vsnll 
x 10" 03 Nrxnl Rbfoxl Slclal Tceal 

x 1(T 02 Ghrnb3, Fbxw7, Map3kU, Mapt, Pde4dip, Ptprv, Rbfoxl, 

Slclal, Sv2a, Vsnll 
x 1 0 02 Mapt, Ptprc, Slc24a3, Sv2a 

x 1 0 02 Idua, Mapt, Slclal, Ube3a 

x 10" 02 Mapt, Nrxnl, PdeAdlp, Ptprc, Rbfoxl, Sv2a, Ube3a 

x 1(T 02 Chrnb3, Fbxw7, Map3kU, PdeAdlp, Ptprv, Rbfoxl, Slclal, 

Sv2a, Vsnll 
x 1 0 02 PdeAdlp, Ube3a, Zfp36ll 

x 1(T 02 Chrnb3, Fbxw7, Hcrtrl, Idua, Map3kl2, Mapt, PdeAdlp, 

Ptprv, Rbfoxl, Slclal, Sv2a, Ube3a, Vsnll 

x *\0~ 02 Fbxw7, Map3kl2, Mapt, PdeAdlp, Ptprc, Ptprv, Rbfoxl, 

Slclal, Slc2Aa3, Sv2a, Vsnll 

x 1 0 03 Chrnb3, Hcrtrl, Mapt, Nrxnl, Rit2, Slclal, Sv2a 

x 1 0 02 lapp, Mapt, Slclal, Ube3a 

x 1 0~ 02 Gbx2, Idua, Ube3a 

x 10 02 Gbx2, Idua, Irx6, Map3kl2, Mapt, Ptprc, Ube3a 

x 10 04 Chdl3, Chrnb3, Gbx2, Gp5, Mapt, Mmpl3, Mpl, Pcsk2, 

Rbfoxl, Stmn2, Sv2a, If pi, TnfsA, Vsnll 
x 1 (T 03 Chrnb3, Mapt, Sv2a, Vsnll 

X 1 0" 03 Gp5, Mpl, Stmn2, Tfpi 

x 1(T 02 Gdhl3, Gp5, Mapt, Mmpl 3, Pcsk2, Rbfoxl, TnfsfA 

x 1(T 02 Hpgds, Mapt, Slcl6a7, Tfpi, TnfsfA 

x 1 0 03 Abcg2, Gblb, Esrl, Hspa5, Ncfl, Ppp3ca, Rorc 

x 1 0 02 Esrl, Hells, Hoxa3, Plagll 

x 1(T 03 Esrl, Foxbl, Npas3 

x 1 0 02 Esrl, Foxbl, Ncfl, Npas3, Ppp3ca 

x 1 0 02 Esrl, Klk3, Plagll, Ppp3ca 

x 1 0 02 Esrl, Itgb2, Masp2, Ppp3ca 

x 10 02 Ada, Bmx, Esrl, Folhl, Itgb2, Klk3, Ltbpl, Slc9a3r2, 

Tiparp, Wars2 



Significantly overrepresented functional annotations among regulated genes of heterozygous mutant embryos are listed for male and female mice. 
Analysis was done using the Ingenuity pathway analysis tool. 



Recently, thousands of common variants have been identified 
for common traits causing a big debate about the underlying bi- 
ology of common genetic variants conveying only small effect 
sizes. The greatest challenge in the "post-GWAS" era is to under- 
stand the functional consequences of these loci. Here, we showed 
a SNP-dependent enhancer impairment, suggesting RLS-relevant 
differences in the regulation of downstream target genes in the 
embryonic ganglionic eminences, the prospective basal ganglia. 
This supports the concept of RLS having elements of a neuro- 
developmental disorder. Now, for the first time, we are getting closer 
to a better understanding of the neuronal pathways involved in 
RLS, enabling us to explore new drugs and treatment strategies in 
the future. 

Methods 

Mutational screening, SNP discovery panel 

One hundred eighty-eight unrelated RLS-affected individuals 
(72.8% female, mean age 60.0 ±11.2 yr) were screened for variants; 
94 patients were homozygote for the risk allele of rs 12469063 
(G/G), and 94 patients were heterozygote (G/A). We used high- 



resolution melting curve analysis with LightScanner (IDAHO 
Technology, Inc.). The targeted region was the 32-kb LD block 
(hgl8; chr2: 66,601,989-66,635,212) as defined by the fine- 
mapping of the original GWAS signal using tagging-SNPs of the 
MEIS1 gene and ± 10 kb 5' and 3' of sequence in 903 cases and 
891 controls of European ancestry (Winkelmann et al. 2007). 
Oligonucleotide sequences for 145 amplicons are available in 
Supplemental Table S7. DNA was analyzed in doublets to prevent 
false-positive signals. Samples with aberrant melting patterns 
were Sanger sequenced. 

Association study: case /control population, imputation, 
and statistical analysis 

Genotyping was performed in 735 unrelated RLS-affected in- 
dividuals (70.8% female, mean age 61.5 ± 14.2 yr) from Germany, 
and 735 age-, sex-, and ethnicity-matched population-based con- 
trols of the KORA cohort (74.5% female, mean age 59.8 ± 11.3 yr) 
using the MassARRAY system and Sequenom iPLEX Gold tech- 
nology (Sequenom, Inc.). Oligonucleotide sequences are available 
in Supplemental Table S8. A total of 59 variants were selected for 
genotyping (Supplemental Table SI; Supplemental Methods). 



Genome Research 599 

www.genome.org 



Spieler et al. 



Total distance 



B Average speed 



E 40000 
o 

30000 
20000 
10000 



~Z_ E 



10- 



T 



c 




eg 




E 


120- 


£= 




TO 




c 






100- 


1 






80- 




60- 



wildtype heterozygous 
genotype 

Average V02 

female heterozygous 

♦ female wildtype 

♦ male heterozygous 

♦ male wildtype ^ * 



wildtype heterozygous 
genotype 



D Average distance 




20 



-I - 

25 



I 

30 



I 

35 



40 



Average body weight [g] 



Figure 6. Motor restlessness/hyperactivity in adult heterozygous 
Meisl tm1Mtor m\ce. Heterozygous Meisl tml ' Mtor (net) traveled a higher total 
distance (P= 0.003, two-tailed t-tests, n = 29) (A) and moved with a higher 
speed on average (P = 0.004, two-tailed t-tests, n = 30) (B) in a 20-min 
open field test for spontaneous locomotor activity in a novel environment. 

(C) Oxygen consumption (mL/h) plotted versus body mass (g). Scatter- 
plot with regression lines split by sex and genotype to adjust for body mass 
variation. The shift in regression lines indicates higher energy turnover 
in heterozygous Meisl tmlMtor mice (linear regression model, P = 0.0005). 

(D) Mean distance traveled (cm/20 min) monitored by infrared light beams 
during the indirect calorimetry trial. Heterozygous Meisl tmlMtor mice 
tended to show increased locomotor activity under home cage conditions 
(two-way ANOVA, P = 0.06). 



SNPs were imputed in a larger RLS case/control (1302/1259) 
set using MACH 1.0 (http://www.sph.umich.edu/csg/abecasis/ 
MACH/index.html) and standard settings. Post-imputation qual- 
ity control exclusion criterion was an Rsq score < 0.3. Statistical 
analysis was performed as logistic regression with disease status as 
the dependent and allele dosage as the independent variable using 
gender as a covariate. All calculations were done using R 3.0.1. For 
the conditional analysis, either rs2300478 or rs 12469063 together 
with rs6710341 was entered as an additional covariate. Westfall- 
Young minimal-P method was used to correct for multiple testing 
with n = 10,000 permutations, and the significance level was set at 
0.05 (Westfall and Young 1993). The plot for Figure 1 was gener- 
ated using the UCSC Genome Browser (http://genome.ucsc.edu/), 
LocusZoom vl.l (Pruim et al. 2010), and Haploview 4.2 based on 
HapMap data (CEU) (The International HapMap Consortium 
2003). Haplotype analysis was performed in the genotyped data set 
only, using both Haploview 4.2 (Barrett et al. 2005) (http:// 
www.broadinstitute.org/scientific-community/science/programs/ 
medical-and-population-genetics/haploview/haploview) and 
UNPHASED 3.1.7 (Dudbridge 2008) (https://sites.google.com/site/ 
fdudbridge/software/unphased-3-1). Haploview 4.2 using the 
method by Gabriel et al. (2002) showed that there was one risk 
haplotype significantly enriched (or depleted) (assessed at a 5% 
level after 20,000 permutations) in these cases. This haplotype 
was perfectly tagged by rs2300478/ rsl2469063 and rs6710341. 
For statistical testing and estimation of ORs, we used UNPHASED 
3.1.7 including gender as a covariate. 



Genomatix software 

Matlnspector and SNPInspector software (Cartharius et al. 2005) 
were employed to predict putative transcription factor binding 
sites and to analyze in silico SNP dependent differences, respec- 
tively. GeneRanker software (Berriz et al. 2003) was used for char- 
acterization of large sets of genes using annotation data from 
various sources, such as Gene Ontology or Genomatix proprietary 
annotation. 



EP300 ChlP-seq and DNase I hypersensitivity profile 

EP300 ChlP-seq data for forebrain of El 1.5 murine embryos was 
mapped to all HCNRs of the LD block examined (Visel et al. 2009). 
We used the ENCODE/University of Washington "UW DNase I HS 
track/' available online in the UCSC Genome Browser (http:// 
genome.ucsc.edu/) (Sabo et al. 2006). 



HCNR selection and cloning of HCNR reporter vectors 

For determination of HCNRs, we used the VISTA Genome Browser 
v2.0 (http://pipeline.lbl.gov/cgi-bin/gateway2) (Dubchak et al. 
2000). HCNR selection settings are given in the Supplemental 
Methods. HCNRs were amplified from human DNA. For each 
HCNR, two constructs with the protective and risk alleles of the 
associated SNPs were cloned. For common SNPs located in HCNRs, 
but not genotyped successfully, the respective construct included 
either the allele enriched in the discovery sample (rsl 13851554, 
allele T) or the allele correlated with the risk allele of a tagging 
SNP (rsl 18971 19, allele T) as risk allele. All vectors were Sanger 
sequenced and did not include any additional rare variants. 
Primer sequences and vectors are detailed in the Supplemental 
Methods. 



Generation and analysis of transgenic animals 

In zebrafish, microinjections and screening was performed as 
previously reported (Royo et al. 2011). Transgenic mouse lines 
(C57BL/6, Harlan) were generated by pronucleus injection of lin- 
earized vectors. Primer sequences used for genotyping are detailed 
in the Supplemental Methods. Intensity of beta-galactosidase 
staining was measured using ImageJ software (http://rsbweb. 
nih.gov/ij/index.html) on pictures of entire embryos (n protective = 4; 
^risk = 4). High-precision design-based stereology was used to 
quantify the volume of the stained area of the ganglionic emi- 
nences (^protective = 4; H risk = 4). Volume was determined on S-[im 
sections on a stereology workstation (Stereo Investigator, 
MicroBrightField). On every fourth section, the area of interest 
was counted using the Cavalieri estimator probe (scan grid size = 
20 urn; height of the unbiased virtual counting spaces = 12 urn). 

Histological methods 

Embryos were immersion-fixed in 4% paraformaldehyde/PBS. Ra- 
dioactive in situ hybridization was performed on 8-|im paraffin 
sections as previously described (Giesert et al. 2013). Immunohis- 
tochemistry on paraffin sections was performed according to stan- 
dard procedures. Templates for in vitro transcription of riboprobes 
and antibodies used are detailed in the Supplemental Methods. 

Electrophoretic mobility shift assay (EMSA) 
and overexpression of CREB1 in 293T cells 

For EMSA experiments, forebrain nuclear extracts of E12.5 mouse 
embryos and CREB1 protein overexpressed in 293T cells were used. 
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Experimental details and 5'-Cy5-labeled oligonucleotides are 
given in the Supplemental Methods. 

Affinity chromatography and mass spectrometry 

Nuclear protein extracts from forebrain of E12.5 mouse embryos 
were incubated with double-stranded biotinylated oligonucleotide 
probes. The sequence of the double-stranded biotin-labeled olig- 
omers used was the same as in EMSA, replacing Cy5 with biotin 
labeling. Proteins were eluted with an ascending salt concentra- 
tion (200-1250 mM NaCl). The affinity chromatography was per- 
formed three times, and the differential elution fractions were 
analyzed with mass spectrometry. Proteins were proteolysed with 
trypsin, and LC-MS/MS analysis was performed with an HPLC 
system (Ultimate3000, Dionex), directly coupled to an LTQ Orbi- 
trap XL (ThermoFisher Scientific) as described previously (Merl 
et al. 2012). For quantitative analyses, spectra were loaded into the 
Progenesis LC-MS software (version 4.0, Nonlinear) and analyzed 
as previously described (Hauck et al. 2010). Peptide identifications 
were performed using Mascot (Matrix Science, version 2.3) and the 
Ensembl mouse database (Release 62; 54,576 sequences) with one 
missed cleavage allowed, a parent ion tolerance of 10 parts per 
million (ppm), and a fragment ion mass tolerance of 0.6 Da. 
Carbamidomethylation was set as fixed; methionine oxidation, 
and asparagine or glutamine deamidations were allowed as vari- 
able modifications. A Mascot-integrated decoy database search 
calculated an average peptide FDR of < 2%. Peptide assignments 
were reimported into Progenesis LC-MS. Normalized abundances 
of all unique peptides were summed up and allocated to the re- 
spective protein, comparing the protective (NR) with the risk (R) 
condition. 



RNA isolation and transcriptome analysis 

Total mRNA was extracted from forebrains of E12.5 mouse em- 
bryos (wild-type and Meis 1 tmlMtor/ - mice), and transcriptomes were 
analyzed using MouseRef-8v2 Expression BeadChips (Illumina). 
Details of statistical, cluster, and pathway analyses are given in the 
Supplemental Methods. 

Behavioral and metabolic phenotyping of Meisl tmJMtor/ ~ mice 

Previously described Meisl tmlMtor mice were phenotyped at the 
German Mouse Clinic (Azcoitia et al. 2005; Gailus-Durner et al. 
2009). Experimental settings of reported findings are detailed in 
the Supplemental Methods. 

Ethics 

The KORA study was reviewed and approved by the local ethical 
committee (Bayerische Landesarztekammer). All animal work was 
performed in accordance with the German Animal Welfare Act and 
the Spanish Ethical Committee for Animal Research from Consejo 
Superior de Investigaciones (CSIC). Approval for the generation 
of the transgenic mice was obtained from the Landesdirektion 
Dresden (Az. 24-9168.11-9/2005-1). 

Data access 

The complete cRNA anay data set from this study has been submitted 
to the NCBI Gene Omnibus (GEO; http://www.ncbi.nlm.nih.gov/ 
geo/) under accession number GSE44592. All novel variants have 
been submitted to NCBI dbSNP (https://www.ncbi.nlm.nih.gov/ 
SNP/). Their ss-numbers are given in Supplemental Table SI. 
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